Search CORE

9 research outputs found

Bangla optical character recognition

Author: Habib S. M. Murtoza
Noor Nawsher Ahmed
Publication venue: School of Engineering and Computer Science (SECS) , BRAC University
Publication date
Field of study

This thesis paper is a partial fulfillment of the requirements for the degree of Bachelor of Science in Computer Science and Engineering, BRAC University

BRAC University Institutional Repository

A high performance domain specific OCR for Bangla script

Author: Habib S. M. Murtoza
Hasnat Md. Abul
Khan Mumit
Publication venue: BRAC University
Publication date
Field of study

Includes bibliographical references (page 5).Abstract-Research on recognizing Bengali script has been started since mid 1980’s. A variety of different techniques have been applied and the performance is examined. In this paper we present a high performance domain specific OCR for recognizing Bengali script. We select the training data set from the script of the specified domain. We choose Hidden Markov Model (HMM) for character classification due to its simple and straightforward way of representation. We examine the primary error types that mainly occurred at preprocessing level and carefully handled those errors by adding special error correcting module as a part of recognizer. Finally we added a dictionary and some error specific rules to correct the probable errors after the word formation is done. The entire technique significantly increases the performance of the OCR for a specific domain to a great extent

BRAC University Institutional Repository

Segmentation free Bangla OCR using HMM: Training and Recognition

Author: Md. Abul Hasnat
Mumit Khan
S. M. Murtoza Habib
Publication venue
Publication date: 01/01/2007
Field of study

The wide area of the application of HMM is in Speech Recognition where each spoken word is considered as a single unit to be recognized from the trained word network. Using this concept some research has been done for character recognition. In this paper, we present the training and recognition mechanism of a Hidden Markov Model (HMM) based multi font supported Optical Character Recognition (OCR) system for Bangla character. In our approach the central idea is separate HMM model for each segmented character or word. We emphasize on word level segmentation and like to consider the single character as a word when the character appears alone after segmentation process is done. The system uses HTK toolkit for data preparation, model training from multiple samples and recognition. Features of each trained character are calculated by applying Discrete Cosine Transform (DCT) to each pixel value of the character image where the image is divided into several frames according to its size. The extracted features of each frame are used as discrete probability distributions that will be given as input parameter to each HMM model. In case of recognition a model for each separated character or word is build up using the same approach. This model is given to the HTK toolkit to perform the recognition using Viterbi Decoding. The experimental result shows significant performance

CiteSeerX

BRAC University Institutional Repository

DEVELOPMENT OF ANNOTATED BANGLA SPEECH CORPORA

Author: Dil Afroza Sultana
Firoj Alam
Mumit Khan
S. M. Murtoza Habib
Publication venue
Publication date: 01/01/2010
Field of study

This paper describes the development procedure of three different Bangla read speech corpora which can be used for phonetic research and developing speech applications. Several criteria were maintained in the corpora development process that includes considering the phonetic and prosodic features during text selection. On the other hand, a specification was maintained in the recording phase as the speaking style is a vital part in speech applications. We also concentrated on proper text normalization, pronunciation, aligning, and labeling. The labeling was done manually – in the present endeavor sentence level labeling (annotation) was completed by maintaining a specification so that it could be expanded in future. Index Terms —Speech corpora, Phonetic research, Speech processin

CiteSeerX

BRAC University Institutional Repository